18.785: Analytic Number Theory, MIT, spring 2007 (K.S. Kedlaya) 
Small gaps between primes (after Goldston-Pintz-Yildirim) 

In this section, we introduce the strategy initiated by Goldston-Yildirim, and carried out 
by them and Pintz, for proving new results on the existence of short gaps between primes. 
Some calculations are postponed to a later unit. 

References: much of this unit is liberally plagiarized from K. Soundararajan, Small gaps 
between prime numbers: the work of Goldston-Pintz-Yildirim, Bull. Amer. Math. Soc. 
44 (2007), 1-18. For more details (which will be plagiarized later), see D.A. Goldston, 
Y. Motodashi, J. Pintz, and C. Yildirim, Small gaps between primes exist, Proc. Japan 
Acad. Ser. A Math. Sci. 82 (2006), 61-65. (Both references are available online, e.g., via 
MathSciNet.) 

1 The target theorem 

Let p n denote the n-th prime. As noted in the previous unit, we can use a probabilistic 
model to make plausible predictions about the ratio (p n +i — Pn)/(}ogp n ), by supposing that 
ir(x + y) — 7r(x), for x large and y ~ A logx, obeys a Poisson distribution with parameter A. 

What we will prove is a rather crude assertion consistent with this model. (Before this 
work, this was only known for e ~ 0.24.) 

Theorem 1 (GPY). For any e > ; there exist infinitely many p n such that p n+ i — p n < 
elogp„. 

Goldston et al also get a quantitative version of this, and they can even do this with 
Pn+i — Pn < (logp n ) 1 ~ <: for some specific e > 0. For simplicity, I won't get into these 
improvements. But I will discuss the following, whose proof is a good setup for the proof of 
Theorem 1. 

Theorem 2 (GPY). Assume the Elliott- Halberstam conjecture for Q = x e with any fixed 
9 > 1/2. Then there exists c = c{6) such that there exist infinitely many p n such that 
Pn+i ~Pn<c. (If 9 > 20/21, one has c{6) = 20.) 



Fix a positive integer k; the basic idea is to try to prove a weak version of the Hardy- 
Littlewood /c-tuples conjecture, for a A;-tuple 7i = (hi, . . . , hk) of distinct integers. Namely, 
we'll try to prove that there are infinitely many n such that at least two of n + hi, . . . , n + 
are prime; this would imply that there are infinitely many prime gaps no greater than 
max7i — min?i. 

To do this, we will try to find an arithmetic function a(n) with nonnegative values, such 
that for j — 1, . . . , k, we can establish 



2 The approach 




(i) 
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If we had such a function, we could sum over j to obtain 



#{1 < j < k : n + hj prime} • a(n) > \_, a ( n ), 




which would immediately imply that for some x < n < 2x, at least two of n + hi, . . . , n + h). 
are prime. (Note: this strategy is poorly adapted to look for three or more primes in the 
same tuple. In fact, no satisfactory alternative has been proposed!) 

Note that if a(n) is supported only on those n for which n + hi, . . . , n + h^ is prime, then 
the /c-tuples conjecture would imply (1), but we have no hope of proving (1) directly. Instead, 
we make a transition that is directly inspired by the transition from the combinatorial sieve 
to the Selberg sieve. 

3 Selberg revisited 

Namely, we pick a cutoff parameter R (which will ultimately depend on k and x), and choose 
a{n) of the form 



for some arithmetic function p with p(l) = 1 and support in {1,...,R}. As in Selberg's 
sieve, we have built the nonnegativity requirement into the construction, and we are now 
free to vary the values of p in order to maximize the ratio between the two sides of (1). 

Unfortunately, we are not in as simple a situation as in Selberg's sieve, where we could 
simply diagonalize a quadratic form to find the desired minimum. In our case, we are 
comparing two different quadratic forms, which cannot be simultaneously diagonalized, and 
hitting the situation with Lagrange multipliers creates a mess. The best we can hope to do 
is to pick p of a special form with at least one parameter left in, run the calculation, and 
then optimize the choice of the parameter (s). 

In Selberg's sieve, the optimal choice would have been 



a(n) 






In our setting, we will instead put 





(2) 



for £ a nonnegative integer depending on k, in a fashion to be specified later. 
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4 Comparing the two sides 

With this choice, one can calculate the two sides of (1) using the sorts of techniques we used 
in the first section of this course. I will postpone those calculations to a later unit, so that 
I can continue giving an overview of the method. First, here is what one gets for the right 
side of (1). 

Lemma 3. With notation as above, there exist C, c > depending on k, £, such that for 
R<x^ 2 /(\ogx) c , 

x h ~( k + WQo&R)™ I i) X{l0gR) + ° { (log R)W ) ■ 

The left side of (1) is more complicated, because of the extra restriction that n + hj 
must be prime. It is on this side that the arithmetic subtleties will creep in. Expanding the 
square, we get 

^ p(d 1 )p(d 2 )i^{x < n < 2x : [di, d 2 ]\(n + hi) ■ ■ ■ (n + h k ),n + hj prime}. (3) 

The count on the right side involves first pinning n down among some number of arithmetic 
progressions modulo the 1cm [di, d 2 ], then looking for primes in that arithmetic progression. 
Thus one expects to be able to approximate (3) by 



x 



E p( d Md2)§^4r,, (4) 

where g is the multiplicative function with g{p) = v n (p) — 1. 

Lemma 4. With notation as above, there exist C, c > depending on k, £, such that for 
R < x 1 / 2 /(\ogx) c , we have the following. 

(a) For h ^H, (4) equals 

&{H,h) (k + £)\ 2 (2£\ x n _ k+2e t ^ fx(\ogx) k+2e - 2 (\og\ogx) 



■(log R) k+M + 



(\ogR) 2k+2e (k + 2£)\\£ J\ogx K ta ' \ (lo gj R) 2fc +^ 

(b) For h eH, (4) equals 

&(H) (k + e)\ 2 {2(l + l)\ x {l ^ R)k+2e+1+0 fx(\ogx) k + 2e -\\og\ogxy 



(\ogR) 2k + 2i (k + 2£+l)\\ £+1 J\ogx K to ' V ( lo S^) 

Crunching the numbers, we see that the ratio between (4) and the right side of (1) is 
asymptotic to 

logi? 2k(2£+l) 



\ogx {£+ \){k + 2£+ 1)' 
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(5) 



The second fraction is always less than 4, but it tends to 4 as k, £ — > oo. Thus if we 
can safely approximate (3) by (4) in the range R < x 1 ^ 2 " 6 , or even R < x 1 / 4+e , we get 
bounded gaps between primes. (Here's where we get stuck looking for three primes in one 
tuple: we can't hope to get past R = x 1 ^ 2 ' 6 because of our earlier errors.) For instance, 
if we could take R = x 1 ^ 2 ' 6 , then already we get (4) with k — 7, £ — 1. Using the 7-tuple 
TL — {11, 13, 17, 19, 23, 29, 31}, one then deduces that there are infinitely many prime gaps 
of size at most 20. 

One can tweak the above argument by changing (2) to allow a polynomial P(\og(R/d) / (log R)) 
instead of just a power. That polynomial must satisfy P(l) = 1 and must vanish to order 
at least k at 0. The quantity analogous to (5) is 

hgR jtSy.r^-yydy 
h % x fo^yP {k) d-y) 2 dy ' 

If we can take R = x 1 ^ 2-6 , then one can get this ratio over 1 already with k = 6, so one gets 
infinitely many prime gaps bounded by 16 (using Ti = {7, 11, 13, 17, 19, 23}) rather than 20. 
But even with the flexibility of choosing P, one can never get the second factor (excluding 
(log /?)/ (log a;)) over 4 (exercise)! 



5 The error terms, first attempt 

None of the above matters unless we can control the discrepancy between (3) and (4). This 
discrepancy is spawned by error terms in the prime number theorem with moduli of the form 
[di, d 2 ] for d±,d 2 < R, so the moduli can run up to R 2 . 

Now recall what we know about these discrepancies. Let ir(x;N,m) be the number of 
primes p < x congruent to m modulo N . If we allow Elliott-Halberstam, then for any fixed 
A > and e > 0, there exists c > such that 



E. 



max 

q<Q 



X 

7i(2x; N, m) — 7r(x; N, m) — 



(f)(N) log a; 



< err (log re)" 



for Q = x 1 ^ 1 . This would allow taking R = x 1 ^ 2 " 6 ; we thus deduce Theorem 2. 

Unconditionally, Bombieri- Vinogradov only allows Q = x 1 ^ 2 ^ 6 . This looks like a disaster: 
we must take R = x 1 ^ 4 " 6 , and so we can never get (1)! What now? 



6 The error terms, second attempt 

Remember that Theorem 1 is a much weaker assertion than the existence of infinitely many 
bounded gaps between primes; there is thus no need to insist on establishing (1) for any 
particular tuple H. Instead, we are free to aggregate over all H in a certain range; to clarify, 
write a(n; Ti) instead of a(n) to indicate the dependence on Ti. 
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Fix 5 > for which we want infinitely many n with p n+1 — p n < H = 5 log x. We will 
now try to prove the inequality 

E E a( n ,n,h)>l E E E ( 7 ) 

H€{l,...,/O fc l<h<H,n+h=p He{l,...,H} k l<h<H,n+h=p x<n<2x 

which again is enough: now we get an n such that at least two of n+1, . . . ,n + h are prime. 

For the right side of (7), Gallagher's result from the previous unit gives us the same 
asymptotics as before, except with &{7~t) replaced by 1 and slightly worse error terms. We 
get an improvement on the left side, which we separate into terms with h ^ 7i and terms 
with h eH. We estimate the latter termse exactly as before; for the former terms, we note 
that if n + h is prime, then 

a(n; H) = a(n; H, h). 

Namely, the difference comes from summands d which divide (n + hi) ■ ■ ■ (n + hk)(n + h) but 
not (n + hi) ■ ■ ■ (n + hk); those are all multiples of n + h > x > R, so p(d) = for such d. 

Thus we can simply appeal back to Lemma 3 with k replaced by k + 1 and TC replaced by 
7i, h. If we now compare the ratio of the two sides of (7), the contribution in the numerator 
from h E H is exactly (5), to which we add the contribution H/(\ogx) = 5 from the terms 
with h H. As noted earlier, that's just enough to get over 1 with R = x 1//2_e and k,£ 
sufficiently large. This yields Theorem 1. 

Exercises 

1. Use the Poisson distribution model to compute a predicted distribution for the ratio 

(Pn+l -Pn)/(l0gPn). 

2. Say we want to produce large gaps between primes. Take iV to be the product of the 
primes up to m, and consider N + 2, . . . , N + m. For what function / does this imply 
p n+ i -p n > f(p n ) for infinitely many n? 

3. Let P be a polynomial with P(l) = 1 vanishing to order at least k at 0. Prove that 
the quantity (6) sans the factor (log R)/(\og x) is at most 4. 
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